Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Software tools and test data for research and testing of page-reading OCR systems

Identifieur interne : 001373 ( Main/Exploration ); précédent : 001372; suivant : 001374

Software tools and test data for research and testing of page-reading OCR systems

Auteurs : Thomas A. Nartker [États-Unis] ; Stephen V. Rice [États-Unis] ; Steven E. Lumos [États-Unis]

Source :

RBID : Pascal:05-0361379

Descripteurs français

English descriptors

Abstract

We announce the availability of the UNLV/ISRI Analytic Tools for OCR Evaluation together with a large and diverse collection of scanned document images with the associated ground-truth text. This combination of tools and test data will allow anyone to conduct a meaningful test comparing the performance of competing page-reading algorithms. The value of this collection of software tools and test data is enhanced by knowledge of the past performance of several systems using exactly these tools and this data. These performance comparisons were published in previous ISRI Test Reports and are also provided. Another value is that the tools can be used to test the character accuracy of any page-reading OCR system for any language included in the Unicode standard. The paper concludes with a summary of the programs, test data, and documentation that is available and gives the URL where they can be located.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Software tools and test data for research and testing of page-reading OCR systems</title>
<author>
<name sortKey="Nartker, Thomas A" sort="Nartker, Thomas A" uniqKey="Nartker T" first="Thomas A." last="Nartker">Thomas A. Nartker</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Information Science Research Institute (ISRI) University of Nevada, Las Vegas</s1>
<s2>Las Vegas, NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Nevada</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Rice, Stephen V" sort="Rice, Stephen V" uniqKey="Rice S" first="Stephen V." last="Rice">Stephen V. Rice</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Information Science Research Institute (ISRI) University of Nevada, Las Vegas</s1>
<s2>Las Vegas, NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Nevada</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Lumos, Steven E" sort="Lumos, Steven E" uniqKey="Lumos S" first="Steven E." last="Lumos">Steven E. Lumos</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Information Science Research Institute (ISRI) University of Nevada, Las Vegas</s1>
<s2>Las Vegas, NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Nevada</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">05-0361379</idno>
<date when="2005">2005</date>
<idno type="stanalyst">PASCAL 05-0361379 INIST</idno>
<idno type="RBID">Pascal:05-0361379</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000454</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000334</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000380</idno>
<idno type="wicri:doubleKey">1017-2653:2005:Nartker T:software:tools:and</idno>
<idno type="wicri:Area/Main/Merge">001410</idno>
<idno type="wicri:Area/Main/Curation">001373</idno>
<idno type="wicri:Area/Main/Exploration">001373</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Software tools and test data for research and testing of page-reading OCR systems</title>
<author>
<name sortKey="Nartker, Thomas A" sort="Nartker, Thomas A" uniqKey="Nartker T" first="Thomas A." last="Nartker">Thomas A. Nartker</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Information Science Research Institute (ISRI) University of Nevada, Las Vegas</s1>
<s2>Las Vegas, NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Nevada</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Rice, Stephen V" sort="Rice, Stephen V" uniqKey="Rice S" first="Stephen V." last="Rice">Stephen V. Rice</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Information Science Research Institute (ISRI) University of Nevada, Las Vegas</s1>
<s2>Las Vegas, NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Nevada</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Lumos, Steven E" sort="Lumos, Steven E" uniqKey="Lumos S" first="Steven E." last="Lumos">Steven E. Lumos</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Information Science Research Institute (ISRI) University of Nevada, Las Vegas</s1>
<s2>Las Vegas, NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Nevada</region>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">SPIE proceedings series</title>
<idno type="ISSN">1017-2653</idno>
<imprint>
<date when="2005">2005</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">SPIE proceedings series</title>
<idno type="ISSN">1017-2653</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Accuracy</term>
<term>Algorithm</term>
<term>Availability</term>
<term>Data gathering</term>
<term>Document image processing</term>
<term>Optical character recognition</term>
<term>Performance evaluation</term>
<term>Program verification</term>
<term>Reading device</term>
<term>Software tool</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Outil logiciel</term>
<term>Appareil lecture</term>
<term>Reconnaissance optique caractère</term>
<term>Disponibilité</term>
<term>Traitement image document</term>
<term>Evaluation performance</term>
<term>Algorithme</term>
<term>Collecte donnée</term>
<term>Précision</term>
<term>Vérification programme</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">We announce the availability of the UNLV/ISRI Analytic Tools for OCR Evaluation together with a large and diverse collection of scanned document images with the associated ground-truth text. This combination of tools and test data will allow anyone to conduct a meaningful test comparing the performance of competing page-reading algorithms. The value of this collection of software tools and test data is enhanced by knowledge of the past performance of several systems using exactly these tools and this data. These performance comparisons were published in previous ISRI Test Reports and are also provided. Another value is that the tools can be used to test the character accuracy of any page-reading OCR system for any language included in the Unicode standard. The paper concludes with a summary of the programs, test data, and documentation that is available and gives the URL where they can be located.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
<region>
<li>Nevada</li>
</region>
</list>
<tree>
<country name="États-Unis">
<region name="Nevada">
<name sortKey="Nartker, Thomas A" sort="Nartker, Thomas A" uniqKey="Nartker T" first="Thomas A." last="Nartker">Thomas A. Nartker</name>
</region>
<name sortKey="Lumos, Steven E" sort="Lumos, Steven E" uniqKey="Lumos S" first="Steven E." last="Lumos">Steven E. Lumos</name>
<name sortKey="Rice, Stephen V" sort="Rice, Stephen V" uniqKey="Rice S" first="Stephen V." last="Rice">Stephen V. Rice</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001373 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001373 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:05-0361379
   |texte=   Software tools and test data for research and testing of page-reading OCR systems
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024